Configuring: Horizontal Merge
Select the two datasets to merge. The placement of the first row of the first dataset cam be aligned with any row of the second dataset. Selecting to merge only overlapping entries means that only rows which contain values in both datasets will be included - this influences the size of the resulting dataset. The newly created dataset will contain the selected fields from both datasets, listed next to each other.
Worked example: Horizontal merge
Steps to configure
-
Configure merge: Configure the alignment of the rows of the two datasets and choose whether to only merge overlapping entries.
-
Select first dataset's fields: Select fields from the first dataset to be included in the created dataset.
-
Select second dataset's fields: Select fields from the second dataset to be included in the created dataset.
-
Execution: A new dataset will be created, containing the selected fields from both datasets listed next to each other.
Step 1: Configure merge
Configure the alignment of the rows of the two datasets and choose whether to only merge overlapping entries.
Align rows:
-
Select which row of dataset 1 must be aligned with which row of dataset 2.
-
The total number of dataset rows for each component dataset is listed next to the text box.
-
Empty datasets are indicated by the text [Empty Dataset].
-
The range of the edit boxes for both datasets is from 1 to the maximum integer value of 2147483647, regardless of the number of rows in the dataset.
-
Null values are inserted whenever either dataset does not have a value at the row index where the other one does.
-
Default: Row 1 of each dataset is aligned.
-
Minimum configuration: Row numbers must be listed for aligning each of the datasets.
Merge only overlapping entries:
-
Selecting this checkbox is optional.
-
When you select to merge only overlapping entries, this operation will only merge rows where both datasets contain values.
-
For example, if Dataset 1 has 20 rows, and Dataset 2 has 10 rows, and you choose to only merge overlapping entries:
-
with no offset configured: during the merge the last 10 rows of Dataset 1 will be removed.
-
offset of 5 to Dataset 2: the first 5 and last 5 rows from Dataset 1 will be removed.
-
offset of 10 to Dataset 2: the merge will remove the first 10 rows from Dataset 1.
-
-
-
The final resulting dataset will therefore only have valid entries; null values will only exist in this dataset where null values existed in either of the original datasets.
-
If this option is unchecked, all the rows from both datasets will be included in the final dataset.
-
The size of the resulting dataset will always be the same size or smaller than the largest of the original datasets.
-
Default: The checkbox is checked.
Step 2: Select first dataset's fields
Select fields from the first dataset to be included in the created dataset.
-
Default: All fields are selected.
-
Minimum configuration: At least one field must be selected.
Step 3: Select second dataset's fields
Select fields from the second dataset to be included in the created dataset.
-
Default: All fields are selected.
-
Minimum configuration: At least one field must be selected.
Step 4: Execution
A new dataset will be created, containing the selected fields from both datasets listed next to each other. This dataset will adhere to the selections made in the first step of configuration - merge offset and merge only overlapping entries.
Related topics: